Model Selection, Hypothesis Testing, and Risks of Condemning Analytical Tools
نویسنده
چکیده
Model selection based on information-theoretic methods (Burnham and Anderson 2002) has gained prevalence as a strategy for analyzing ecological data, especially among wildlife biologists (Stephens et al. 2005). Development of this alternative strategy has been refreshing because it has encouraged many of us to reexamine the analytical strategies we use and especially to evaluate models built explicitly on a biological foundation. However, some proponents now advocate model selection as the only reasonable strategy for a wide range of analyses. In particular, there is growing sentiment that strategies based on the framework of hypothesis testing are now systematically inferior or inappropriate. Because all analytical strategies have strengths and weaknesses, all can be misused. My goal is to suggest that analysts make an informed choice from all available strategies, employing each in contexts where they are most informative, as the most appropriate strategy will always depend on the specific analytical context. For example, authors of recent texts on distance sampling (Buckland et al. 2001, 2004), an estimation framework rooted firmly in model selection, and authors of a monograph written by acknowledged experts in model selection (Burnham et al. 1987), use hypothesis tests routinely when they are appropriate and informative. Application of a single analytical strategy in all circumstances is inappropriate. Failure to consider important parameters or to obtain sufficient data limits all analytical strategies, including model selection. If sample sizes are small and precision of estimates is low, the ability to distinguish among candidate models will be weak, analogous to having low statistical power for traditional hypothesis tests. Selecting a model solely because it satisfies a fixed criterion such as the lowest Akaike’s Information Criterion (AIC) is potentially as naı̈ve an approach as rejecting a null hypothesis of zero effect on the basis of a P-value being marginally less than 0.05. In the absence of additional information such as estimated effect sizes, these approaches are too simplistic to be uniformly reliable as endpoints of an analysis. Model selection can be an elegant and effective strategy to distinguish among candidate models based on data. Reliance on a small, fixed set of candidate models, however, presents an additional potential liability for model selection. If the set of candidate models is incomplete, the resulting inferences will be unreliable (Burnham and Anderson 2002, Johnson and Amland 2004). Therefore, in cases where a strong set of candidate models is not or cannot be developed, model selection may be less effective and less informative than approaches based on hypothesis testing. Consequently, model selection requires more information about the system of interest because inferences are contingent on this set of candidate models. The process of developing candidate models is a strength of model selection when done well and a weakness when done poorly. Although I routinely use both approaches, colleagues and reviewers have suggested increasingly that I use model selection instead of hypothesis testing in contexts where I felt hypothesis testing was more appropriate. Similarly, I have observed authors using model selection when their analyses may have been better informed by hypothesis tests. In general, I find hypothesis testing to be a more informative strategy when there is insufficient information available to formulate a strong set of candidate models. I will use a hypothetical study of habitat associations to illustrate a context where hypothesis testing may be more informative than model selection. Although wildlife ecologists often are chastised for assessing pattern too frequently, these studies are commonplace and can be valuable in the appropriate circumstances. Typically, the short-term goal in these studies is to identify environmental features that explain variation in abundance of a species. The long-term goal often is to identify habitat features that managers can alter to meet a conservation or management objective for a species. On a series of study plots, assume that we estimate abundance, perhaps using a distance-sampling or capture– recapture framework where we choose a model for estimation based on model selection. We then assess associations between abundance and the environmental features measured on each plot. Because we have little a priori information as to which features may be influential, we measure 12 different environmental features that span a range of physical and biological elements. We then seek to identify a subset of features that is closely associated with abundance. To use model selection in its intended spirit, we would need to develop a set of candidate models that potentially link abundance with habitat features that we can differentiate objectively through an information criterion such as AIC. In circumstances where there is a solid foundation for 1 E-mail: [email protected]
منابع مشابه
Quality Function Deployment Method for Selection of Effective Management Tools on Setting EFQM Model
Using the effective management tools that are relevant tothe organization’s needs for excellence has become so important for thecompanies to improve their performances and then increase customersatisfaction and gain market shares. Quality function deployment is anefficient and powerful tool in design, development, and planning of products.The main function of quality function deployment is conv...
متن کاملA New Approach to Project Risk Responses Selection with Inter-dependent Risks
Risks are natural and inherent characteristics of major projects. Risks are usually considered independently in analysis of risk responses. However, most risks are dependent on each other and dependent risks are rare in the real world. This paper proposes a model for proper risk response selection from the responses portfolio with the purpose of optimization of defined criteria for projects. Th...
متن کاملLINEAR HYPOTHESIS TESTING USING DLR METRIC
Several practical problems of hypotheses testing can be under a general linear model analysis of variance which would be examined. In analysis of variance, when the response random variable Y , has linear relationship with several random variables X, another important model as analysis of covariance can be used. In this paper, assuming that Y is fuzzy and using DLR metric, a method for testing ...
متن کاملبرآورد دوزهای هدف با استفاده از روش MCPMod در مطالعات دوز-پاسخ
Background and Objective: Determining adequate model and estimation of target dose with high precision is a key goal in dose-response studies. MCPMod is a new method that used for model selection and estimation of target doses and unlike other methods it does not have any limitation. In this method, selection of models has performed by hypothesis testing also increasing responses that are a cr...
متن کاملThe Analysis of the Existence of the Hypothesis of Adverse Selection on the Relationship between Off-balance Sheet Items and the Bank's Risk
Balance sheet itself does not specify and show all the activities that a bank pays. Because banks can do many swap contracts and obligations, exchange, and commitments Outside of the balance sheet. To such activities and exchange that will not appear on the balance sheet, are saying off-balance sheet activities. These items are usually reported in the notes to the attached financial statements....
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2007